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Abstract 



This work concerns physical layer collision recovery for cheap sensors with allowed variations 
in frequency and delay of their communications. The work is presented as a generic, communication 



theoretic framework and demonstrated using UHF RFID tag technology. Previous work in this area has 
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not provided recovery for more than two tags, which is shown to be possible in this work. Also presented 
£f~) • is a novel mathematical model of the tag signal, incorporating the allowed variations in frequency and 

delay. 

The main motivation is seen in the observation that random variations in frequency and delay 
make the collided signals of different tags separable. The collision recovery is done by estimating the 
sensor specific variation in frequency and delay and using these estimates in a successive interference 



cancellation algorithm and a maximum likelihood sequence decoder, to iteratively reconstruct a sensor 
signal and remove it from the received signal. 

Numerical simulations show that the estimates and proposed algorithm are effective in recovering 
collisions. The proposed algorithm is then incorporated into a numerical simulation of the Q-protocol 
for UHF RFID tags and is shown to be effective in providing fast and power efficient sensor arbitration. 
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I. Introduction 

In the current standard for UHF Radio Frequency IDentification (RFID), the protocol imposes 
a simple tag-to-reader communication to allow for a simple tag structure (T). Collisions occur 
at the reader when multiple tags simultaneously reply to a query sent from a reader. To combat 
this, a range of anti-collision, or arbitration protocols have been designed, which ensure that all 
tags at some point during arbitration are queried individually. 

Since tag production is targeting low prices, large variations are allowed on the parameters 
employed by a tag to modulate the backscattered signal. With the existing decoding techniques, 
the reader uses a coherent structure to mitigate the problem of varying tag encoding parameters 
[0. On the other hand, in this work, we analyze a new communication theoretic framework, 
in which such a variation across tags is considered an enabler allowing to differentiate tag 
information in a collided tag reply. This may be seen as cheap CDMA, where the code separation 
between tags is generated due to production tolerances. The two parameters that vary across tags 
are the link frequency and the time of reply, see Fig. [Q 

Link frequency exhibits the most significant variation. That is the frequency used by a tag 
to encode a reply during backscatter onto the amplitude modulated carrier from the reader. 
The tolerance limits for this frequency, defined in EPCglobal UHF Class-1 Generation-2 (EPC 
Gen2) flD, allow up to ±22% variation per message from the nominal link frequency, denoted 
the backscatter link frequency (BLF). Also, to allow for minimal synchronization at the tag side, 
the time where a tag initiates a reply is also allowed to vary. The range can be as large as 24/is 
for some BLFs which is equivalent to the duration of several encoded symbols. 

In this work, we show how these two parameters may be estimated and used for decoding 
multiple tag replies in a single slot. To the best of our knowledge, this is the first method to 
achieve decoding of more than two UHF RFID tags by using diversification of the tag parameters. 
In the authors show how to decode up to four LF tags using signal constellations in the I/Q 
domain. Their method assumes centralized, reader-controlled link frequency, which is not valid 
for UHF tags due to the allowed frequency variation. For UHF tags it is shown in 01 how up to 
two tags can be decoded, also using the signal constellations in the I/Q domain. This is done using 
zero forcing and successive interference cancellation. Neither work uses maximum likelihood 
sequence detector (MLSD) to improve the decoding. Other work uses multiple antennas for 
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separation of multiple tags [|5]|, (61, 0, where our work is based on single antenna systems. 
Also, a related field is tag population estimation such as in flU, flU, OH- Similar methods of 
code separation are also used in these works for estimating the number of tags in a collision. 

One contribution of this work is a detailed mathematical signal and channel model of tag to 
reader communication in UHF RFID communication (described in Section HO) and an estimator 
of link frequency and delay of a tag reply (derived in Section ITTll). This estimator is then shown 
to be useful for multiple tag decoding, when used together with the Q-protocol from EPC 
Gen2 (shown in Sections [TV] |V] and [VTl) . This protocol is used for arbitration of tags and is an 
ALOHA-based protocol, which resolves tags in slots. A slot can be either Single, Collision or 
Idle. Only when a slot is Single is a tag resolved. Using our work, we show the gain of being 
able to resolve some of the Collision slots as well. In short, our communication theoretic analysis 
of the RFID channel allows us to transform the tag synchronization diversity from a foe to a 
friend, and show that in the UHF RFID case, the variations in the sensor encoding parameters 
may be used with a MLSD to enhance the decoding of multiple tags. We show that a significant 
gain is achievable when using our method in the Q-protocol. Note that the ideas presented here 
are applicable beyond UHF RFID, to a wide range of scenarios with cheap, passive, clock-less 
tags and sensors. 

The structure of thepaper is visualized in Fig. [2] The tag resolution process is iterative because 
the parameter estimation process obtains the parameters of one tag at a time - the strongest in 
the collided signal. The process therefore consists of estimating the parameters of the strongest 
tag, decoding its data, subtracting its signal contribution and reiterating the process to find the 
next strongest tag. This continues until there are no more tags present in the residual signal. 

II. Signal and Channel Model 

This section describes the mathematical framework we have derived for representing tag 
signals and the channel model we employ to simulate their transmission over the air. We begin 
with the derivation of basis functions and the signal space representation of UHF RFID tag to 
reader communication, which is based on either FMO or Miller encoding [Q. As tag to reader 
communication is based on backscattering [11] of a carrier wave, the tag transmission signal 
should be seen as a control signal, specifying whether a tag backscatters the carrier wave or not. 

An example of the control signal for the short preamble in FMO encoding is shown in Fig. [3] 
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This example is used in the remainder of this section to explain the signal encoding. We first 
define the basis functions and signal waveforms used to generate each individual symbol. Then 
we describe the state machine that generates sequences of FMO and Miller symbols. 



A. Basis Functions for FMO and Miller Encoding 



Let M p = {m ,mi, . . . ,m NM _i},m n G {0, 1} be the data message backscattered by tag p 
after a Query, QueryAdjust, or QueryRep command, not including pre- and postamble. This 
message is the reply message in a slot during arbitration with the Q-protocol. It contains a 
16-bit random number (RN16), and so Nm = 16. To transmit this message, a tag first encodes 
it using FMO or Miller after which it is backscattered to the reader, through the channel. The 
FMO and Miller basis functions are not rigorously defined in flTJ , but the signal waveforms for 
the respective encoding schemes are specified. 

Let 0fe(t), k = 0, 1 be basis functions having support duration MT. That is (j>[(t) = for 
t < and t > MT, where M is a symbol period multiplier. For FMO, M = 1, and the basis 
functions are: 
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where the uses of rect(-) are scaled so the bases have unit energy^. The tag signal is generated 
here with ideal on-off keying. Noise and the hardware limitations do not allow such an instanta- 
neous transition, however the difference is considered negligible, as in e. g. 0, lfT2ll . For Miller, 
M = 2,4, 8, corresponding to the number of subcarrier cycles in the basis function: 
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'Notice that the basis functions do not have zero mean, i.e. they correlate highly with the readers carrier wave echo. However, 
because the tag signal has overall zero mean, as is shown later, this is not a problem. 
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The basis functions are shown in Fig. HI when evaluated for the tag dependent subcarrier period 
defined from the tag dependent link frequency as TJ iP = -jk-, where fi tP is the link frequency for 
tag p where the allowed frequency variation is included. 

B. Possible Signal Waweforms using Basis Functions 

RFID tags modulate the received waveform by alternating the control signal between two 
states: (OFF) and 1 (ON). OFF means that the tag absorbs the power it receives and ON 
means it reflects it back. The control signal is generated in two steps: 

1) Signal waveforms are derived from the encoding dependent basis functions in Eqns.[T]and 
Q] with signal levels ±|. 

2) A constant offset of ~ is added to the encoded message to create the control signal. 

In the first step, a set of signal waveforms are found as a linear combination of the basis functions. 
This is accomplished using the following signal space representation: 



= v <,p v v 



(2) 



10-10 
10-1 

This matrix we shall use in later sections as a dictionary for possible signals. Using this signal 

space representation, the possible signal waveforms are generated as: 

i 

sf- p (t) = Y,^ P (t), (3) 

where Vjj picks out an element from in Eqn. ©, (j)- l,p (t), j =0,1 are the encoding dependent 
basis functions and where the signal level is — | when the tag absorbs and \ when it reflects. 
Adding the signal level offset \ to these signal waveforms in their support duration gives the 
control waveforms illustrated in Fig. [5l 

The control waveforms now allow us to generate single symbols. The following describes 
how symbol sequences are generated using the memory in the encoding schemes. We exploit 
this memory later in the decoding, which significantly improves the decoding. 

C. Generating the Control Signal using the Inherent Encoding Memory 

An important property for FMO and Miller is the inherent memory, where the signal waveforms 
used for encoding of the symbol m n depends on the previously sent symbol m n _i. Let the signal 
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waveforms s k l ' p (t) correspond to state s k , then the state machine for FMO and Miller is in Fig. [61 
from which we obtain the symbol-dependent transition matrices H mn , m n = {0, 1}: 
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(5) 



where the (k, k')th entry equal to 1 indicates a valid transition from state s^/_i to state s fc _x. Also, 
let Sm p be a 4 x Nm state select matrix generated using H , Hi and the message Ai p . Each 
column vector s Mp:Tl in Sm p is one of the coordinate column vectors e k , where k = 1,2,3,4 
denotes the state s , s ls s 2 , or s 3 , respectively, used to encode the nth symbol in Ai p : 



S M P ,0 S M P ,1 



S M P ,N M -1 



H mo S; n i t H mi Sq 



(6) 



where s init denotes the state prior to the first symbol in M. v . This state follows from the last 
symbol in the preamble. 

An example is the state select matrix used to generate the signal in Fig. [3j 

1 1 
1 
1 
10 10 

From EPC Gen2 it is known that the state for the last transmitted symbol in the preamble is s\ 
for FMO and s 3 for Miller, and the respective initialization vectors are: 



(7) 
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The control signal waveform describing the message part for tag p directly follows as: 

n m -i 3 

°M P {t) = Yl e l+i S M p en+isl l ' p {t - nMT hp ) + y p (t), 

n=0 k=0 

= £ Y, e k+i S M,^iVe<l>l ltP (t-nMT l 0)+y J ,(t), (8) 

n=0 A:=0 



where D Mp = T^ P MN M is the duration of the data message, and %(t) = ~ rect I j adds 

the offset ensuring that the control signal has signal levels or 1. 

Pre- and postamble control signals are added to the message control signal, where the preamble 
depends on whether FMO or Miller is used for encoding. Let c pT . :P (t) be the preamble control 
signal generated with tag link frequency fi >p , and let D pr p be the support duration of the preamble. 
Also, let c po , p (t) be the postamble control signal with support duration equivalent to the duration 
of one symbol and the complete transmitted signal from tag p is: 

= Cpr,p(f) ~\~ CM,p(t D prp ) + Cp .p (t D p r,p ■ (9a) 



In the following sections an alternative representation of Eqn. (|9al) is used where the message 
and the postamble symbol are included in a combined structure. It is known from EPC Gen2 
that the postamble symbol is symbol-1 which corresponds to state s\ or s 3 depending on the 
state of the last encoded symbol in Ai p . Let the state select matrix S p be 4 x (Nm + 1) where 
the last entry is for the postamble symbol: 
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which follows from Eqn. © and Eqn. (l9al) is rewritten from an extended version of Eqn. ([8]): 

N M 3 
n=0 k=0 



c p (t) = c pTtP (t) + y^y^ e^ +1 S p e„ + is^' p (t - D pi . iP - nMTi tP ) + j p (t), 



where the support duration of 7 P (t) is increased to include the postamble symbol y p (t) 
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D. Channel Model and Received Signal 

Let y p (t) be the signal corresponding to a single tag reply where the effect of the channel 
between tag p and the reader is captured. Assuming a linear time-invariant (LTI) channel with 
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flat fading during the short period of communication: 

y p (t) = H RTRyP A T b cj,(t), (10) 

where c p (t) is the on-off key modulated square wave control signal for tag p from Eqn. ©, T b 
is the fraction of the power, which the tag is able to backscatter |fl3ll . and A is the amplitude 
of the transmitted carrier wave from the reader. H RTR p is the complex channel coefficient 
Hrtr, p = H RTp = H^ R p , which models that the channel coefficient between the reader and tag 
p, H R t, p , is the same as the channel between tag p and the reader, Htr, p , due to reciprocity. 
Hrtr,p captures fading, antenna gains, and path-loss. Then the received signal at the reader is: 



p-i 



z'(t) = Y,y P (t-Tp) + L + o(t), (ii) 

p=0 

where P is the number of tags that participate in the response, t p is the unknown random 
delay for tag p, L is the leakage from the reader's transmit antenna and the scatteres of the 
unmodulated carrier wave, and 0(t) is additive white Gaussian noise (AWGN) added at the 
reader. More specifically, the antenna leakage can be decomposed as L = H RR A + L ant , where 
L ant is the antenna leakage, and H RR is the complex channel coefficient for the reader-to-reader 
channel. We model H RR p with Rayleigh fading because the line of sight component is captured 
by the direct antenna leakage L ant . 

Note that y p (t) in Eqn. (flOl) has infinite bandwidth because of the on-off keyed control 
signal c p (t). However all other components in y p (t) are low-pass. To model the effect of the 
receive filter at the reader, we introduce an ideal low-pass filter at the reader side, defined as 
hi(t) = 2W smc(2Wt), where W is the positive bandwidth of the low-pass equivalent signal. 
Additionally, all tag replies are amplitude modulated, thus only the envelope of the received 
signal contributes to the information in z'(t). The received low-pass envelope on the reader is: 



z(t) 



z'(r)hi(t - r)dr 



p-i 



^2 H RTR,pAT b C p (T - T p ) + HrrA + L ant + 0(t) 
.p=0 

III. Parameter Set Estimation 



hi(t — r)dr 



(12) 



After having developed a detailed model for a tag signal and of its output through the commu- 
nication channel, the next step is to derive the structure for the signal parameters estimation, as 
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shown in Fig. [21 Specifically, the estimation module described in this section estimates the link 
frequency, Tj p and delay, t p , of the strongest tag in the received signal, as defined in Eqn. (fT2l) . 

The information in a tag reply is encoded using the tag dependent control signal c p (t), which 
is true for all tags in the reply. For estimating the two parameters, link frequency and delay, it is 
important to have a-priori known information about the structure of the tag replies in z{t). The 
estimation procedure is designed to exploit the fact that in a reply all tags, independent of the 
link frequency and delay chosen by tag p, use the same structure in the preamble control signal 
c pr ,p(t) to control the absorb and reflect state during backscatter of the preamble in a reply. The 
structure in the preamble is the key used in the estimation framework introduced, where a mother 
function, ip(t), representing the preamble structure is designed. Then, a number of derivatives 
of this mother function is created, denoted as daughter functions, ^^{t), which are scaled (a) 
and translated (b) versions of the mother function. B A daughter function is defined as: 

lpa,b(t) = V f — ] (13) 



a 

Each daughter function is correlated with the received signal z{t) and the largest magnitude is 
used to estimate the frequency and delay of the strongest tag in the incoming signal. This 
approach is motivated by the fact that the mean of the received preamble signal may be 
approximated by: 

E[z(t)} = otil> ( ^ J + /3, b<t<b + D pr , (14) 

where a is an estimate of the signal level, is the offset added to remove the zero mean property 
of the mother function (more about this later), and D pr is the duration of the preamble. The 
expectancy operation averages across the white noise. 

The correlation framework is defined using the received signal and a daughter function: 



T(a,b) = (z(t),^ a , b (t)) = z(t)^ a , b (t)dt, (15) 

J — oo 

Calculation of T(a, b) for a range of a and b results in a three-dimensional representation, where 
a measure of the correlation of the received signal z(t) with various daughter functions are given. 
Similarly, if the scalogram E(a,b) = T 2 (a,b) is considered, then: 

(a p , b p ) = arg max E(a,b) (16) 
aeA, fees 



2 This framework is similar to that used in wavelet signal processing, see e.g. 1141 . 
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is the pair telling that it is very likely that tag p, with link frequency f t)P = j-, delayed b p is 
present in z(t). We use scalogram since if the channel has incurred a phase shift, the received 
signal has a large negative peak in the correlation representation. The search ranges for a and 
b, A and B, respectively, are defined in the standard and depend on the settings of the reader. 

The objective is to explicitly evaluate Eqn. (fl~6l) . The mother function is designed to capture 
the preamble structure in Section IIII-Al and the scaling and translation of the mother function 
leads to the definition of the daughter function in Section IIII-BL 

A. Mother Function 

Let ip(t) be the real valued mother function that satisfies the following two requirements: 

• if}(t) must have finite energy, i.e. ift 2 (i)dt < oo. This ensures that the correlation is 
bounded in time. 

• ip(t) must have no zero-frequency (DC) component in its support duration, i.e. it must 
be zero mean. Thereby the function is able to differentiate between signals based on their 
structure rather than their signal level. 

Furthermore, the mother function is designed such that each bit in a symbol has duration |s 
ensuring that the link frequency of the mother function is 1Hz, and so each symbol in the 
preamble has unit energy. Thanks to this normalization the link frequency of a daughter function 
(which is designed in the following section) becomes 1/a when evaluated with the scaling 
parameter a. The preamble structure consists of linear combinations of the basis functions derived 
in Section HH and the signal waveforms with unit energy are: 

Jt\t) = s r\t) = -fr\t). (i7) 

The mother function depends on whether FMO or Miller is used. Additionally, in the query sent 
by the reader, the parameter TRext specifies, which of two different preambles to use for a 
given encoding when a tag replies. Let N pr be the number of symbols in a preamble, and a 
4 x state select matrix S pr is generated in the same way as in Section [U from the preamble 



11 



structures in EPC Gen2 as: 



FMO,TRext=0 
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iMiller,TRext=0 
'pr 



lMiller,TRext=l 
pr 



ei ei ei ei e x e 2 e 3 e 4 e 2 e 4 



ei e x ... ei 



Miller, TRext=0 
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where e fc indicates the state Sk-i, or equivalently that signal waveform Sk-i(t) in Eqn. (TT7T ) is 
used to generate the mother function. With this in mind, let the square-wave modulated mother 
function be: 



Np r -1 3 



Np r -1 1 



2^ 2^<+i^n + isi-\t-nM)= ^ ^e^A+i^-^t-nM), (18) 

n=0 fe=0 n=0 fc=0 

where V is the signal constellation matrix from Eqn. ©. Notice that the preamble structure has 
zero mean and that the inherent memory structure of both FMO and Miller encoding is violated 
in the preambles. This ensures that it is not possible for the designed function to correlate as 
strongly to the data as to the preamble. 

A final consideration on the design is that z(t) contains the low-passed filtered signal. How- 
ever, since each daughter function is recalculated for each value of a and b, to contain the 
computational complexity we consider the daughter functions square-wave modulated. 



B. Daughter Function 

A daughter function of the mother function is defined as: 



ip a ,b(t) = w{a)ip 



t 



(19) 



where the scaling and translation parameters are used in the evaluation of the mother function, 
and where w(a) is a weighti, which ensures that all configurations of a daughter function are 
equally weighted and not biased by the parameter set (a, b) when matched onto the input signal 
in the correlation (z(t), ip a ,b{t))- The amplitude level for a tag when a bit is backscattered is the 
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same during communication no matter what link frequency is used. Thus the energy in a tag 
preamble oscillating with a fast frequency is less than the energy in a tag preamble oscillating 
with a slow frequency. The match constraint to determine w(a) therefore takes into account z(t). 

Lemma 1: (Proved in the Appendix) The weight ensuring that daughter functions are correctly 
scaled for all values of a and b in a reply containing only one tag reply is: 

w(a) = -. (20) 
a 

IV. Data Decoding 

The estimates of the link frequency - and the delay offset b obtained using the framework 
presented in the above makes it possible to decode one and possibly several tag replies in a 
received signal, even when they are dispersed in time and frequency. This corresponds to the 
tag resolution part shown in Fig. [2] and is treated in this section. 

One shortcoming of using UHF RFID as a use case for general multiple sensor decoding 
is that EPC Gen2 does not currently support multiple Ack commands after a collided reply, 
which clearly affects the possible gain in data decoding. In the description of the data decoding 
algorithm that follows, it is assumed that multiple Ack commands are allowed in the standard, 
to show the true potential of our method. 

To decode multiple tag replies an iterative, greedy algorithm, Successive Interference Cancel- 
lation (SIC), is used. Before explaining the SIC algorithm further, the next section derives the 
optimum decoder structure for a single tag, using the above described estimators. 

A. Optimized Single Tag Decoding 

The optimal algorithm for a detector, where the memory in a sequence satisfies the Markov 
property as the nth symbol in a sequence only depends on the (n — l)th decoded symbol, is the 
Viterbi algorithm lfl"5l . In addition to the memory structure in the encoding scheme, there are 
two additional a-priori known structures in EPC Gen2 to improve the decoding: 

1) The last symbol in the preamble and the signal waveform used to create it is a-priori 
known and are previously found in Eqn. ©. 

2) The postamble symbol after the data part is a-priori known to be symbol-1. 

Recall from Section llFCl that the structure containing both the memory, the data message M. and 



the postamble symbol is the A-by-(N M + 1) state select matrix S 
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where each Sj is a coordinate vector e k and k = 1,2,3,4 indicates which of the respective 
signal waveforms s (t), si(t), s 2 (t), and s 3 (i) is used to encode a symbol in the tag reply. The 
objective is therefore reformulated to estimate the state select matrix S as it contains the memory 
structure, encoded message, and postamble. 

Let the signal processed in the ith iteration be ri{t), with r (t) = z(t), and assume that it 
only contains one tag reply with parameters a and b. The expected value of rj(t) is then similar 
to the control signal c p (t) in Eqn. ©: 

N M 3 
n=0 k=0 

where ctj is the signal level at the reader side. Notice here that the daughter function has a 
superscript 1 attached. This is because this daughter function must be scaled to have unit energy 
per symbol, rather than to have correct scaling for all values of a and b in a reply containing only 
one tag signal. This results in changing the weight in Eqn. (|25l) from - to -?=. This change 
in scaling allows for the correct scaling afterwards to the signal level, aj. ji(t) is the offset 
introduced to ensure signal levels that correspond to the way a tag backscatters its reply (recall 
Eqn. ©), (3 is an estimate of the reader leak and D pr i = a,iMN pr is the estimated duration of 
the preamble. The signal waveforms: 

*?(*) = <f>o (t) sT (t) = <Pt (t) s%(t) = (t) # (*) = -#*(*). ( 22 ) 

follow from the symbol basis functions in Eqn. (OQ) and Eqn. CQ) and can be represented in terms 

of the signal space representation matrix V as: 

i 

^W = E e )+i Ve ^^W- (23) 

3=0 

Let the outcome of the MLSD be an estimate of S, denoted S, and consider the Viterbi 
algorithm as the black box for MLSD in Fig. |7J for optimal decoding it uses the three data 
structures: 

• Initial state vector s init — The initial state seeding the decoding which follows from the 
state corresponding to the last symbol in the preamble. For FMO, this state is s\ and for 
Miller it is s 3 , and the vector is defined in Eqn. ©. 

• A 4 x (N M + 1) matrix Z — Cost matrix not considering memory, where each element 
represents how well each of the four signal waveforms s&(i), k = 0,1, 2, 3, match a singled 
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out symbol part in the residual, where a large element value indicates a good match. Values 
are found for the N M = 16 data symbols and the postamble symbol. 
• A 4 x 4 matrix H — Memory structure representing the allowed paths between states 
observed for two adjacent encoded symbols. 
The memory structure matrix H is given by the valid state transitions. In the generation of 
the control signal at the tag in Section III-Cl the two matrices H mn=0 and H mn=1 are derived 
describing the transition to make, conditioned on the symbol m n to send. H follows as the 
version where the transmitted symbol is unknown: 

jjFA/O _ jjFAfO _|_ jjFA/0 jjMiZJer _ jjAftlier _|_ j^Miller 

and Hk,k' — 1 indicates that it is possible to go from state Sf-'-i to state Sk-i. 

The two matrices Z and H can be represented as a trellis, where each node (except the node 
in the preamble) represent the entries of Z and the possible transitions are given by the entries 
in H. The reason why there are only two possible states for the first symbol is that the previous 
state is known a-priori. The same applies for the postamble as it is known to be a symbol-1. 

It is useful to remark that, if the channel for the tag to be decoded in the ith iteration incurs 
a complete phase shift on the backscattered reply in the residual, this means that the state of the 
last symbol in the preamble is S3 instead of si for FMO, and si instead of S3 for Miller. The 
phase shift can be detected in several ways; for example if < or Tj(aj, bj) < a phase shift 
is introduced. In the sequel this effect is neglected to simplify the description of the decoding, 
however, it is important for an implementation to detect the phase shift and flip the initializing 
state in the decoder. 

B. Successive Interference Cancellation for Data Decoding 

The SIC algorithm works by iteratively estimating the strongest signal component in a signal 
consisting of multiple signal components. The estimated signal component is then subtracted 
from the others, and the next most strongest signal component is estimated and subtracted in 
the following iteration. The ith iteration of the algorithm is defined as follows: 

n+i(t) = n(t) - q(t), 

where r^t) is the current residual signal, q(t) is an estimate of the strongest signal component 
in ri(t) and r i+ i(t) is the resulting residual, used in the next iteration. 
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The ith estimate of the contribution to subtract from the residual follow from Eqn. (1271) where 
the estimate of the transmitted message S is used to model the tag contribution signal <&(t): 

N M 3 

qi(t) = a, (i> l a . ibi (t) + J2J2 e fc+iS e n+i • s k (t-k- D mi - na.M) + 7i (t) J , (24) 

where the zth modulation depth estimate a.^ follow from Lemma [3] in the Appendices and ^(t) = 
w 1 (a i )iect I — ^ — + a {N M +i) — )• The we ight, w 1 (ai), is to ensure unit energy per symbol of 
the signal before scaling with ctj. 

The algorithm is summarized as follows: Let T-L = {v , v 1 , . . .},Vi = (a^, 6«) be the history of 
estimates, assume z(t) contains only one tag reply, set r (t) = z(t) and let i = 0, then: 

1) Find the ith scalogram, as in Eqn. (fT5l) . and determine whether a phase shift has occurred. 

2) Find estimates for the strongest contribution in the newly found scalogram: 

Vi = (a h bi) = arg maxima, b) 

a&A, 
beB 

3) Decode a message from r^t) with location parameters (a i; bi) using the Viterbi algorithm 
described in the previous section. 

4) Generate an estimate of the complete tag contribution q^t) from Eqn. (1241) . 

5) Subtract the estimate from the residual = Tiit) — qi{t). 

6) Let H = {H, Vi} if H Pi Vi — {}, increment i, and re-iterate. 

The termination criteria for the algorithm depends on the scenario it is used in and what happens 
when an Ack is received at a sensor. In the case of UHF RFID tags, there are several possible 
methods, which are treated in the next section, which concerns the implementation of the Q- 
protocol, used in UHF RFID. 

V. Implementing Multiple Tag Decoding in the Q-Protocol 

With the algorithm concluding the previous section, it is possible to decode a single slot 
shown in Fig. [2l This section describes how this is extended to being part of an entire arbitration 
protocol run, using the Q-protocol of EPC Gen2. It is useful to evaluate the effect multiple tag 
decoding has on the Q-protocol, if acknowledging multiple tags is allowed. To enable this, the 
Q-protocol is implemented in MATLAB and Monte Carlo simulations are run, to determine how 
much time it takes to resolve an entire tag population and how many transmissions it takes. 
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The results can be used to evaluate the following (1) Is multiple tag decoding in the Q-protocol 
more time efficient than single tag decoding? (2) Is multiple tag decoding more energy efficient, 
with respect to transmissions count from the reader? 

In the numerical simulations the transmissions by both reader and tag are counted as listed 
in Tab. H Based on the duration of each of these commands and the three timeouts T\, T 2 and 
T 3 from the standard, the duration of the inventorying can be calculated. 

Design Assumptions for Q-Protocol Implementation 

Prior to an experiment, we assume a Select command has been issued and received correctly 
by all tags. This defines the scope of the experiment to inventorying of tags alone. Additionally, 
all tags have their inventoried flag for the selected session set to the same value, either A or B. 
This means that out of N tags, iV tags participate in the inventorying. 

For simplicity the command QueryAdjust is not used during a round to change the value of 
Q. Instead, a round is always completed with a chosen Q after which a new Query command is 
issued with a new value of Q. A QueryAdjust can increment or decrement Q during an inventory 
round, and when to use it during arbitration must be analyzed thoroughly first, to understand its 
effect on time and energy usage. We have therefore not optimized the Q-protocol for multiple 
tag decoding and we expect that a higher gain is achievable if this is done. 

It is assumed that the reader can determine perfectly whether a slot is idle or not and whether 
a slot contains any remaining tag signals. This is in order to focus on tag decoding, not on 
detecting whether there are tags to be decoded. In a future implementation, this detection could 
instead be done based on signal levels. As the variance of the noise can be estimated before 
tag to reader communication, it can be decided whether one or more tags are present, if enough 
samples cross a detection threshold based on the variance during tag to reader communication. 
This threshold may also be used for detecting when the residual in the SIC algorithm contains no 
more tags. In the case of UHF RFID tags an Ack transmission from a reader is quite expensive 
and should be avoided if possible. Otherwise, the termination criteria of the algorithm could be 
based on a digital decision, where the SIC algorithm terminates if no sensor replies the Ack. 

Because each iteration of the SIC algorithm is dependent on the previous iterations and the 
accuracy of the estimates of a, b and the signal level, the estimation of the signal level is assumed 
perfect in this implementation, to focus exclusively on the impact of multiple tag decoding based 
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on estimation of link frequency, a, and delay, b. Also, the reader is assumed to be unable of 
detecting a collision, it will always attempt to decode the strongest tag. If a tag is correctly singled 
out, the Ack and EPC are correctly transferred and decoded. This, to allow for simplicity and 
because if a tag RN16 has already been successfully singled out, the probability of error in 
receiving the Ack and transmitting the EPC is smaller. Additionally, we assume that the forward 
link (Reader-to-Tag) is error free, to be able to focus fully on multiple tag decoding. 

When a single tag is resolved, an extra frame is conducted, to ensure that no weaker tags are 
unresolved. For fair comparison, this is done for both the original and new reader. A change in 
the UHF RFID standard for the tags is assumed, namely that when a tag receives an Ack with 
a wrong RN16, it does not transition to state arbitrate, but remains in state reply. Only when a 
Query or QueryRep command is received does the tag transition to state arbitrate. This allows 
for multiple tag acknowledging by sending an Ack and receiving and decoding the EPC of the 
resolved tag in each iteration of the SIC, rather than only being able to send on Ack per slot. 

VI. Results 

To show the benefit of having multiple tag decoding in a reader, we have performed two 
simulation tests. The first test illustrates the probability of decoding a given number of tags. The 
second test is a comparison of the duration of an inventory round using the Q-protocol when 
using a reader with and without multiple tag decoding. The tests have been made for a scenario 
where the tags choose their link frequency according to a Gaussian distribution with mean equal 
to a nominal BLF of 50kHz and a variance such that 99.73% (3<r using the empirical rule) of 
the generated link frequencies and delays fall within the EPC Gen2 requirements. Also, a long 
preamble (TRext = 1), FMO encoding is used, the distance between reader and tags is set to 
1 meter and the low-pass filter employed has a bandwidth of 1.5MHz. The noise power at the 
reader antenna is set to — 50dBm. 

In the first test a number of experiments and runs are performed. A run is defined as the 
generation of a received signal, as in Eqn. (fl"2l) . and the following decoding of that signal. 
After decoding, the estimated message and the actual encoded message is compared, and if the 
decoding was incorrect, the run is marked as erroneous. An experiment is a series of runs. For 
the first test, 100 experiments, each containing 100 runs, has been performed. The results are 
shown in Fig. [81 where the gain for four different tag cardinalities, P = {2, 3, 4, 5} is illustrated. 
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The result of one experiment is used to calculate the percentage of runs ending in a given number 
of decoded tags. The results from all the experiments are then used to calculate the standard 
deviation of this statistic. The figure shows that it is possible to decode multiple tags, even 
when there are up to five tags present in the collision. Even though the total five tags are rarely 
decoded, the results show that in 50% of the cases some of the tags are decoded, which is a 
gain compared to presently used methods. 

In the other test, we implement the Q-protocol with multiple tag decoding and compare it to 
a normal reader, which only decodes one tag per slot. The initial value of Q is set to 4 and 1000 
runs are conducted. In each run, a randomly generated tag set is resolved using the Q protocol. 
The time it takes to resolve the tag set is found, by counting the transmitted commands, as 
specified in the previous section. The result is averaged over the 1000 runs and plotted in Fig. |9] 
The distribution of the commands is further elaborated on in Fig. [T0l As can be seen from 
Fig. multiple tag decoding decreases the duration of the inventorying, especially for a large 
number of tags. From Fig. [T0l it is clear that with multiple tag decoding, fewer Queries and 
Query Reps are sent. The RN16 count must be further explained. If more than one tag transmit 
their RN16 at the same time, there is a collision, which is counted as one RN16, as the duration 
is independent of the number of participating tags in the collision. The number of RN16s sent 
out has also decreased dramatically, as several tags can be decoded in one slot. The number of 
collisions has increased when using multiple tag decoding, but by a very small amount when 
compared to the savings. The number of idle slots and acknowledgements are roughly the same 
for both single and multiple tag decoding. The number of EPC is exactly identical, as is expected 
for full resolution. The reason why the difference is not larger in Fig. [9] is that the number of the 
most expensive transmissions, the acknowledgement, is unchanged. If the protocol is changed to 
allow for acknowledging multiple tags with a single composite Ack, the performance will greatly 
improve. Overall, the results show that multiple tag decoding does provide savings in time and 
energy and this gain increases approximately linearly, meaning that for tag populations in the 
hundreds and thousands, this would provide a significant increase in time and power efficiency. 

VII. Conclusion 

The concepts presented in this paper show that tag variability can be transformed from foe to 
friend, by using such differences to decode multiple colliding UHF RFID tag replies. By utilizing 
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the knowledge of the tag signal, it is shown to be feasible to distinguish between individual tag 
signals in numerical simulations. Also presented is a detailed mathematical model of the tag 
signals using standard signal representation techniques, which, to the knowledge of the authors, 
has not been presented in this level of detail before for both FMO and Miller encoding. The final 
tests where multiple tag decoding is incorporated into the Q-protocol shows a potential for time 
and transmission savings, in terms of fewer transmitted commands from the tags. 

Appendix A 

Proof of Weight Ensuring Correct Scaling of Daughter Functions 

Lemma 2: The weight ensuring that daughter functions are correctly scaled for all values of 
a and b in a reply containing only one tag reply is: 

w(a) = -. (25) 
a 

Proof: The following simplifications are made for the derivation: 

• Only one tag is assumed to be present in the reply z(t) with link frequency -, and the 
duration of the encoded preamble in the reply is D pr , that is, the tag preamble contributes 
to z(t) for b < t < b + D pr . 

• w(a) is determined for the pair a, b that leads to a maximum or minimum in Eqn. (fl"5T) . i. e. 
only the case where the duty cycle duration a for the encoded tag in z(t) and the duration 
of the mother function MN pr satisfy D pr = aMN, where M is the number of subcarrier 
cycles per symbol and N pr is the number of symbols in the preamble. 

The property to be satisfied is that a daughter function, when correlated with z(t) should satisfy 
a parameter independent correlation level: 

(E[ Zl (t)}^ aiM (t)) = {E[z 2 {t)\^ a , M {t)) (26) 

should be satisfied, where the tag reply in Zj(t) is encoded with different link frequency ^- and 
delay bj but with the same channel, thus the signal levels in z\(t) and z%(t) are equal. In the 
interval b < t < b + D pr the received signal z(t) has the property that its expected value can 
be written in terms of a weighted mother function with the same configuration as the control 
signal used to model z(t): 

E[z(t)] = ai; + P, b<t<b + D pr , (27) 
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where a controls the signal level and j3 the DC component in z(t). Rewriting, letting t' = — : 

E[z(at' + &)] = aip(t') + (3, < t' < MN pr . (28) 

As the signal levels in z{t) clearly does not depend on the encoding parameters a and b, then 
the correlation of Eqn. (1281) with the mother function is therefore constant for all encoded tag 
replies with different a and b. That is: 

(£[*iM + &i)],V(t)> = (E[z 2 (a 2 t + b 2 )}^(t)) (29) 

is a property that is always satisfied when zj(-) is encoded with parameters aj and bj, and thus is 
the property requested in Eqn. (|26|) . As the daughter function is a scaled and translated version 
of the mother function, combine Eqn. (|26l) and Eqn. (|29l) : 

E[z(at + &)]V>(t)tft = / E[z(t)]w(a)i/j l*-—) dt = w(a)a [ E[z(at' + b)]ip(t')dt' 



where t' — — and dt' = — , and the weight is w(a) = - which completes the proof. 



Appendix B 
Proof of Optimal Estimator of a 

Lemma 3: Aim for the lowest contribution for the tag with parameter configuration (a«, h) in 
the scalogram evaluated in the next iteration i + 1, then the optimal estimator for on is: 

/el,;-/ \ (flii bi 



oti = v V_ v - (30) 



MN pr 

where T^a*, &») is the Continous Wavelet Transform (CWT) value for the estimated and hi, 
M is the number of subcarrier cycles per symbol in the encoding scheme and N pr is the number 
of symbols in the preamble. 

Proof: The problem to be optimized is: 



cti = argmin E i+ i(ai,bi), 
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i. e. find the a { which minimizes the contribution in iteration + of the tag found in iteration 
i. The scalogram as a function of the ith residual and the contribution to be removed is: 

/ POO \ 2 / nOQ \ 2 

E l+l {a u b i )=( E[r i+1 (t))ip a . ibi (t)dt) = (/ -&(*)] ip ai A (t)dt 



2 



[E[n(t)] - Oityl^t) + 7i (*))] 1p ai ,bi(t)dt 



Ti(a h hi) -a { ip 1 b (t)ip atA (t)dt - a { %(t)il) aiM (t)dt ) = Ti(a u 6;) - 



a, 



MN pr 



= a ■ at \-Ei(ai,bi), (31) 

where i) follows as the support duration of ^,6^0 ensures mat Qi{t) is not evaluated in the part 
where the extra half symbol is added to and where ii) follows firstly because the daughter 
function is zero mean in the interval of the support duration of 7j(t) and secondly by evaluating 
the daughter function and the daughter function with unit energy per symbol in terms of the 
mother function: 

't-bi\ 1 r° ^MN pr 



TpltM^bMdt = w\a)w(a) / ^ 2 l - dt = -== / ^(t')dt' 



y/Mdi J-oo ^/d~ 

(32) 

Eqn. (|3TI) is a quadratic function where the quadric coefficient is positive, the function is convex, 

and thus its minimum is where the derivative is zero: 

dE i+1 { ai A) 2MiV 2 r 2 v / MA/ pr T,(a i ,& J ) 
1 = ai rr = 

Solving for ai completes the proof. ■ 
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Fig. 1: Signal level of a collided tag signal, with two participating tags, as measured by a 
reader. The nominal link frequency (BLF) is 44.44kHz, which is the reason for the small delay 
difference. The tags are synchronized to begin with, but differs later in the communication, as 
shown by the red circles. 
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Fig. 2: An overview of the following sections. The iterative process of tag resolution is also 
described together with data decoding in Section [IV] 
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Fig. 3: FMO preamble control signal with TRext = (short preamble). The bit sequence is 
{1, 0, 1, 0, v , 1}, where the v is a symbol breaking the encoding (more on this later). The ticks 
on the x-axis denote a symbol duration. 
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Fig. 4: Basis functions for FMO (M = 1) and Miller with M = 2. 
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Fig. 5: Control waveforms for FMO (top) and Miller with M = 2 (bottom) where so(t) and S2(t) 

A MT ltp \ 

encodes symbol-0 and si(t) and s 3 (t) symbol-1. 7 P (t) = ^rect I M7 ? — J is the offset added 
in the support duration. 
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FIGURES 




Fig. 6: State diagrams for FMO and Miller encoding. A and 1 indicates the symbol sent for 
the transition to take place, and Sk indicates the state representing the signal waveform s k ' p (t) 
used to encode a symbol. 
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Fig. 7: Black box illustration of a MLSD. A correlation matrix Z and a transition matrix H are 
used with the information of the initial state and possible end states to estimate the most likely 
transmitted state select matrix S. 
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Fig. 8: Numerically simulated collision resolution success probability for multiple tag decoding. 
The distance is 1 meter and P = {2, 3, 4, 5} tag replies are collided. 
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Fig. 9: The duration of the inventorying in the numerical simulation of multiple tag decoding 
the Q-protocol. 
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Fig. 10: The distribution of the transmitted commands during inventorying in the numerical 
simulation of multiple tag decoding in the Q-protocol. 
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Reader 


Name 


Contents other than payload 


No. of bits in payload 


Query 


delimiter, data-0, RTcal, TRcal 


22 


QueryRep 


delimiter, data-0, RTcal 


5 


Ack 


delimiter, data-0, RTcal 


18 


Tag 


Name 


Contents other than payload 


No. of bits in payload 


RN16 


preamble, postamble 


16 


PC/XPC + EPC + CRC 


preamble, postamble 


128 



TABLE I: Transmitted reader and tag commands H). 



